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1. INTRODUCTION 

In recent work [2], a microphone array consisting of an omnidirectional microphone and 
colocated dipole microphones having orthogonally aligned dipole axes was used to ex- 
amine the directional nature of a room impulse response. The arrival of significant 
reflections was indicated by peaks in the power of the omnidirectional microphone re- 
sponse; reflection direction of arrival was revealed by comparing zero-lag crosscorrelations 
between the omnidirectional response and the dipole responses to the omnidirectiona 
response power to estimate arrival direction cosines with respect to the dipole axes. 

Ideally, a dipole microphone amplifies an incoming signal according to the cosine o 
its arrival angle 9 with respect to the dipole axis <Pd* 

o,d( 0) = cos{9 - v?d)- 

As a result, crosscorrelations between the omnidirectional microphone response rriM{t) 
and the dipole responses m Di (t) normalized by the omnidirectional response power, 

termed directional fractions, 

t+S/2 , t +^/ 2 

CoiMit) = 5Z m Di( r ) Tn w( r )/ m M( T ) m w( T )i 

r=t-S/2 r=t—S/2 

approximate source direction cosines with respect to the dipole axes for the dominate 
signal present in the analysis window [t - 6/2 , t + 6/2}. In [2], direction of arr.val with 
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respect to any pair of axes i and j was estimated for each analysis window as the angle 
of the measured direction cosines vector, 

6e = arctan(Ci),Mi Cd^m)- ^ 

For purposes of room acoustic analysis, however, this processing method has draw- 
backs which limit its usefulness. In particular, when arrivals are not well separated in 
t j me — as i s the case for all but the first few specular reflections in a typical room the 
direction of arrival estimates produced can be inaccurate. Further, measurements made 
using a microphone array with three orthogonally oriented dipole elements may lack 
the spatial information needed to provide psychoacoustically transparent synthesis of a 

measured room. , 

In [1] the performance of the direction of arrival estimate (3) was examined in the 
case of a signal s{t),t = 0, 1, ... ,T - 1 arriving at the microphone array in the presence 
of additive measurement noise with independent identically distributed samples having 
zero mean and covariance a 2 . The direction of arrival estimate (3) was found to be 
unbiased with variance given in units of radians squared by 

Var W = T-k' SNR= ^§ s(t)2/<72 ’ (4) 

where T is the signal duration and SNR is the measurement signal-to-noise ratio. Ac- 
cordingly, for an arrival to be localized to within, for example, 10 degrees on average, 
it must appear with a total energy roughly 30 dB above the average noise background. 
In addition, if other reflections are present in the "noise" background, the directional 
fractions become weighted sums of arrival angle direction cosines, which are dependent 
on crosscorrelations among the reflection signals. The resulting angle of arrival estimate 
may bear no resemblance to the arrival angle of any of the reflections present. 

In this paper, a maximum likelihood estimator and likelihood ratio detector are pre- 
sented which can detect reflections and estimate arrival angles for multiple simultane- 
ously arriving reflections. Rather than determining a single direction of arrival for each 
analysis window, as proposed in [2], the approach presented here forms a function of 
arrival direction at each point in time. The function is designed to indicate the power 
impinging the array as a function of direction of arrival, and will peak at the arrival 
angles of any reflections present. In this way, simultaneously arriving reflections appear 

as distinct maxima, separated in arrival angle. 

We begin by studying the localization information contained in monopole-dipole 
microphone array room measurements. The information inequality is used to place a 
lower bound on the variance of arrival angle estimates as a function of array and room 
characteristics. The use of additional dipole elements is shown to increase angle of 
arrival estimate accuracy, particularly in the presence of overlapping reflections. Finally, 
a likelihood function-based room analysis method is described, and an example room 
measurement presented. 


2. LOCALIZATION ACCURACY 

Consider a set of J source signals Sj(t ) appearing at a monopole-dipole microphone 
array, the jth signal arriving from direction 0j. The microphone array consists of K 



colocated monopole and dipole elements which record signals m k (t) via antenna patterns 
a k {0 ) in the presence of additive measurement noise n k {t). The monopole elements 
pass all signals unchanged, a M (0j) = 1; whereas the dipole elements amplify incoming 
signals according to the cosine of the arrival angle with respect to the dipole axis (fD . 


a D (dj) = cos (dj - ip D ). The microphone signals are given by 

m k (t) = ^a k (6j) ■ Sj(t) +rii(t), t = 0, 1, . . . ,T — 1, (5) 

j=i 

and may be expressed in matrix form as 

M = SA t + N, ( 6 ) 

where M is the matrix of microphone responses m k = [m fc (0) ••• m k (T - 1)] ,A 

is the matrix of array steering columns a(9j ) = [ai(0j) • • * a f<(9j) ] > 

M — [mi ••• mj<], A=[a(0i) ••• a ($./)]> (7) 

S is the matrix of source signals Sj = [sj(0) • • • Sj{T - 1)] T . and N is^the matrix 

of microphone measurement noise samples n k = [n fc (0) • • • n k (T - 1) ] , 

S=[si ••• sj], N = [ni ••• n K \- ( 8 ) 


The measurement noise is assumed to be zero mean with Gaussian-distributed samples, 
and be independent and identically distributed microphone to microphone, p(N) = 
N(Q,I®X:), where T* represents the stack of the columns of its matrix argument, and 
<8) is the Kronecker product defined by 

’ f\\G ••• /irG 

f®g= i : ( 9 ) 

,fp\G SprG . 

F being a PxR matrix with elements f p r . The set of microphone responses is therefore 
Gaussian-distributed with a mean dependent on the unknown signals S and directions 

of arrival 0 = [#i ••• 0j] T , 

j 

p(A? ; e , S) = NW, S ), I ® 17), /*(», 5)=E a ( 0 i) ® *y (10) 


Information Inequality 

Of interest is the ability of the microphone array to determine the arrival direction of each 
of the signals present. In the absence of a particular processing method, the information 
inequality— often referred to as the Cramer-Rao bound [3, pp. 123-129]— can provide 
a gauge of the reflection information contained in array measurements by placing an 
upper limit on the accuracy with which directions of arrival may be determined using 
microphone array measurements. 



For the problem at hand, the information inequality states that the variance in esti- 
mating the unknown directions of arrival 0 and reflection signals S based on microphone 
responses A <f has a lower limit: 



where the direction of arrival and arrival signal estimates 6 and S are assumed to be 
unbiased, and Fq § is the Fisher infoTTfiution mo-tvix, given by 


?e,s = e 


dlnp(A?;0,S) 

d[e 5 t ] t 


^lnp(A?;g,g) 

d[e 5 T ] J 


( 12 ) 


where E{} represents expected value. 


Angle of Arrival Estimate Accuracy 

In view of (10) and (11), the covariance of arrival angle estimates is bounded below, 

Var {#} > A~ l + A~ l Y r (r - YA^Y^YA- 1 , (13) 

where T is the Kronecker product of the steering matrix inner product and the measure- 
ment noise covariance inverse, and A is the element-by-element product of the array 
steering matrix derivative inner product and source signal matrix inner product, 

r={A T A)®£~\ A = (D T D)o(S T S~ l S), Y=(r T D)®(Z- l S) > (14) 

where © represents the Khatri-Rao product defined for matrices F and G, each having 
J columns and g.j, j = 1, ••*,*/. by 

F qG = [f.i ®g.i f.j®9.j \ > ( 15 ) 

and where the steering matrix derivative is given by U — [ ••• eoj J; 

To explore the bound variance behavior as a function of source and array particulars, 
it is useful to first consider the case of a single arrival. 


Var 




(16) 


The bound variance is the inverse product of two terms: s T E l s which is the product 
of signal duration and signal-to-noise ratio, and a geometric factor roughly equal to one 
plus half of the number of dipole elements. In other words, more powerful signals may 
be more accurately localized than weaker ones. In addition, accuracy may be increased 
by including measurements from additional dipole elements. 

It should be pointed out that the arrival direction bound covariance is independent of 
signal direction of arrival when the dipole axes are uniformly spaced on [0, tt). However, 
the dipoles receiving the least signal energy provide the most direction of arrival infor- 
mation, as their antenna patterns are the most sensitive to direction of arrival changes. 



Similar results hold in the case of multiple sources. When T is nonsingular (i.e., 
when A has full column rank) the bound (13) can be written as 

Var {0} > [(D t P 1 a D) o (5 t ^7 -1 ^)] , P A = 1 ~ A{A r A)~ l A T , (17) 

where P A is a projection orthogonal to the columns of the array steering matrix A. 
Again, the bound variance is the inverse of a product of two terms, S T £ 1 S which 
is proportional the measurement duration, signal-to-noise ratio product, and D P A P 
which is roughly one plus half of the number of dipole elements used. ^ ^ 

When the arrival signals are uncorrelated, the signal inner product S S S becomes 
diagonal, and the direction of arrival bound variances approximate the corresponding 
single arrival bound variances. However, if the arrival signals are highly correlated, 
nearby arrivals with similar steering columns a(6j) will have bound variances which 
are noticeably larger than their corresponding single arrival variances. In addition, the 
number of highly correlated arrivals which may be simultaneously resolved is limited to 
the rank of D T P A D — one plus the number of dimensions for a monopole-dipole array. 


3. ARRIVAL DETECTION AND ESTIMATION 


The following presents an alternative detection-estimation scheme to (2), (3). In the 
additive Gaussian measurement noise case considered here, comparing the likelihood ratio 
to a threshold is known to provide the optimal tradeoff between arrival probability of 
detection and probability of false alarm. Further, in the limit of large Fisher information 
(that is, in the limit of small estimate errors) the so-called maximum likelihood estimator 
is unbiased with variance achieving the information inequality bound [3, §6-2]. 

Given microphone array measurements M, the maximum likelihood arrival direction 
and signal estimates 9ml and Sml are those maximizing the loglikelihood, 

(0ml> 5ml) = arg{m^<^(M; 9 , 5)}, (18) 


t{M\ 9, S ) = -i[/i(0, S ) - M] t (IVE l ) [#*(0» 5) - M] T . (19) 

The loglikelihood function may be written in terms of the arrival directions only by 
noting that the set of arrival signals S{0) = MA(A T A)~ l maximizes the loglikelihood 
function for any given arrival direction set 0. We have 

i{M\9) = -^tT{P ± A M r S- l MP L A }. (20) 

Rather than optimize the loglikelihood £{M\9) directly by searching over the set 
of arrival directions 6, here we compare at each arrival direction 6 the loglikelihood 
assuming J arrivals are present, one appearing at 6, with the loglikelihood assuming 
J — 1 arrivals 0 are present, none appearing at 9: 


a(9) T M T S~ l Ma(9) 

A(M; «) = mm 


£(M]9). 


( 21 ) 


The resulting loglikelihood ratio indicates received power as a function of arrival direction, 
and will peak at signal arrival directions. The optimization over 9 is difficult, and 




Figure 1: Normalized Likelihood Approximation Example. 

A(JVf ;0) is approximated as the power in the weighted sum of microphone signals, 
where the weighting is chosen to minimize the output power while passing unchanged 
signals arriving from the "look direction" 9. 

A (M; 9) = min 7 T M T i:" 1 M 7 s.t. a(0) T 7 = 1- (22) 

We conclude with an example measurement. A condensor microphone having a 
rotatable capsule (Neumann USM 69i) was placed in a long narrow hallway along with 
a sound source (Bose AM-Ill). The microphone and sound source were placed in the 
plane perpendicular to the long axis of the hallway, ensuring that all arrivals during the 
measurement period were also from that plane. Impulse responses were measured using 
Golay code pairs with the microphone set to receive signals omnidirectionally and via a 
dipole pattern at a number of dipole axis orientations. 

The loglikelihood ratio approximation (22) derived from the measured impulse re- 
sponses appears in Figure 1. All measured loglikelihood ratio maxima correspond to the 
arrival times and arrival directions predicted by a ray tracing of the room geometry. In 
results not shown here, the angular extent of the loglikelihood ratio maxima were seen to 
decrease with an increasing number of dipole elements. In addition, the likelihood ratio 
approximation was seen to place distinct maxima at the arrival angles of simultaneously 
arriving reflections. 
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